Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 699 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 8 |
| Duplicate rows (%) | 1.1% |
| Total size in memory | 60.2 KiB |
| Average record size in memory | 88.2 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 2 |
| Dataset has 8 (1.1%) duplicate rows | Duplicates |
Bare Nuclei is highly overall correlated with Class | High correlation |
Bland Chromatin is highly overall correlated with Class and 6 other fields | High correlation |
Class is highly overall correlated with Bare Nuclei and 8 other fields | High correlation |
Clump Thickness is highly overall correlated with Bland Chromatin and 6 other fields | High correlation |
Marginal Adhesion is highly overall correlated with Bland Chromatin and 6 other fields | High correlation |
Mitoses is highly overall correlated with Class and 2 other fields | High correlation |
Normal Nucleoli is highly overall correlated with Bland Chromatin and 7 other fields | High correlation |
Single Epithelial Cell Size is highly overall correlated with Bland Chromatin and 6 other fields | High correlation |
Uniformity of Cell Shape is highly overall correlated with Bland Chromatin and 6 other fields | High correlation |
Uniformity of Cell Size is highly overall correlated with Bland Chromatin and 7 other fields | High correlation |
Reproduction
| Analysis started | 2025-01-02 19:11:42.536518 |
|---|---|
| Analysis finished | 2025-01-02 19:11:50.493184 |
| Duration | 7.96 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
Sample code number
Real number (ℝ)
| Distinct | 645 |
|---|---|
| Distinct (%) | 92.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1071704.1 |
| Minimum | 61634 |
|---|---|
| Maximum | 13454352 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 61634 |
|---|---|
| 5-th percentile | 411453 |
| Q1 | 870688.5 |
| median | 1171710 |
| Q3 | 1238298 |
| 95-th percentile | 1333890.8 |
| Maximum | 13454352 |
| Range | 13392718 |
| Interquartile range (IQR) | 367609.5 |
Descriptive statistics
| Standard deviation | 617095.73 |
|---|---|
| Coefficient of variation (CV) | 0.57580794 |
| Kurtosis | 257.71716 |
| Mean | 1071704.1 |
| Median Absolute Deviation (MAD) | 104381 |
| Skewness | 13.675326 |
| Sum | 7.4912116 × 108 |
| Variance | 3.8080714 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1182404 | 6 | 0.9% |
| 1276091 | 5 | 0.7% |
| 1198641 | 3 | 0.4% |
| 1158247 | 2 | 0.3% |
| 1070935 | 2 | 0.3% |
| 733639 | 2 | 0.3% |
| 385103 | 2 | 0.3% |
| 1212422 | 2 | 0.3% |
| 798429 | 2 | 0.3% |
| 1173347 | 2 | 0.3% |
| Other values (635) | 671 |
| Value | Count | Frequency (%) |
| 61634 | 1 | |
| 63375 | 1 | |
| 76389 | 1 | |
| 95719 | 1 | |
| 128059 | 1 | |
| 142932 | 1 | |
| 144888 | 1 | |
| 145447 | 1 | |
| 160296 | 1 | |
| 167528 | 1 |
| Value | Count | Frequency (%) |
| 13454352 | 1 | |
| 8233704 | 1 | |
| 1371920 | 1 | |
| 1371026 | 1 | |
| 1369821 | 1 | |
| 1368882 | 1 | |
| 1368273 | 1 | |
| 1368267 | 1 | |
| 1365328 | 1 | |
| 1365075 | 1 |
Clump Thickness
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4177396 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.8157407 |
|---|---|
| Coefficient of variation (CV) | 0.63737135 |
| Kurtosis | -0.62371541 |
| Mean | 4.4177396 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.59285853 |
| Sum | 3088 |
| Variance | 7.9283955 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 5 | 130 | |
| 3 | 108 | |
| 4 | 80 | |
| 10 | 69 | |
| 2 | 50 | 7.2% |
| 8 | 46 | 6.6% |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 9 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 145 | |
| 2 | 50 | 7.2% |
| 3 | 108 | |
| 4 | 80 | |
| 5 | 130 | |
| 6 | 34 | 4.9% |
| 7 | 23 | 3.3% |
| 8 | 46 | 6.6% |
| 9 | 14 | 2.0% |
| 10 | 69 |
| Value | Count | Frequency (%) |
| 10 | 69 | |
| 9 | 14 | 2.0% |
| 8 | 46 | 6.6% |
| 7 | 23 | 3.3% |
| 6 | 34 | 4.9% |
| 5 | 130 | |
| 4 | 80 | |
| 3 | 108 | |
| 2 | 50 | 7.2% |
| 1 | 145 |
Uniformity of Cell Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.1344778 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.0514591 |
|---|---|
| Coefficient of variation (CV) | 0.97351434 |
| Kurtosis | 0.098802885 |
| Mean | 3.1344778 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2331366 |
| Sum | 2191 |
| Variance | 9.3114027 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 384 | |
| 10 | 67 | 9.6% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 8 | 29 | 4.1% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 9 | 6 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 384 | |
| 2 | 45 | 6.4% |
| 3 | 52 | 7.4% |
| 4 | 40 | 5.7% |
| 5 | 30 | 4.3% |
| 6 | 27 | 3.9% |
| 7 | 19 | 2.7% |
| 8 | 29 | 4.1% |
| 9 | 6 | 0.9% |
| 10 | 67 | 9.6% |
| Value | Count | Frequency (%) |
| 10 | 67 | 9.6% |
| 9 | 6 | 0.9% |
| 8 | 29 | 4.1% |
| 7 | 19 | 2.7% |
| 6 | 27 | 3.9% |
| 5 | 30 | 4.3% |
| 4 | 40 | 5.7% |
| 3 | 52 | 7.4% |
| 2 | 45 | 6.4% |
| 1 | 384 |
Uniformity of Cell Shape
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2074392 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.9719128 |
|---|---|
| Coefficient of variation (CV) | 0.9265687 |
| Kurtosis | 0.00701098 |
| Mean | 3.2074392 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1618592 |
| Sum | 2242 |
| Variance | 8.8322655 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 353 | |
| 2 | 59 | 8.4% |
| 10 | 58 | 8.3% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 7 | 30 | 4.3% |
| 6 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| Value | Count | Frequency (%) |
| 1 | 353 | |
| 2 | 59 | 8.4% |
| 3 | 56 | 8.0% |
| 4 | 44 | 6.3% |
| 5 | 34 | 4.9% |
| 6 | 30 | 4.3% |
| 7 | 30 | 4.3% |
| 8 | 28 | 4.0% |
| 9 | 7 | 1.0% |
| 10 | 58 | 8.3% |
| Value | Count | Frequency (%) |
| 10 | 58 | 8.3% |
| 9 | 7 | 1.0% |
| 8 | 28 | 4.0% |
| 7 | 30 | 4.3% |
| 6 | 30 | 4.3% |
| 5 | 34 | 4.9% |
| 4 | 44 | 6.3% |
| 3 | 56 | 8.0% |
| 2 | 59 | 8.4% |
| 1 | 353 |
Marginal Adhesion
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.806867 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.8553792 |
|---|---|
| Coefficient of variation (CV) | 1.0172834 |
| Kurtosis | 0.98794707 |
| Mean | 2.806867 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5244681 |
| Sum | 1962 |
| Variance | 8.1531906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 10 | 55 | 7.9% |
| 4 | 33 | 4.7% |
| 8 | 25 | 3.6% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.1% |
| 7 | 13 | 1.9% |
| 9 | 5 | 0.7% |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 2 | 58 | 8.3% |
| 3 | 58 | 8.3% |
| 4 | 33 | 4.7% |
| 5 | 23 | 3.3% |
| 6 | 22 | 3.1% |
| 7 | 13 | 1.9% |
| 8 | 25 | 3.6% |
| 9 | 5 | 0.7% |
| 10 | 55 | 7.9% |
| Value | Count | Frequency (%) |
| 10 | 55 | 7.9% |
| 9 | 5 | 0.7% |
| 8 | 25 | 3.6% |
| 7 | 13 | 1.9% |
| 6 | 22 | 3.1% |
| 5 | 23 | 3.3% |
| 4 | 33 | 4.7% |
| 3 | 58 | 8.3% |
| 2 | 58 | 8.3% |
| 1 | 407 |
Single Epithelial Cell Size
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2160229 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.2142999 |
|---|---|
| Coefficient of variation (CV) | 0.68852118 |
| Kurtosis | 2.1690664 |
| Mean | 3.2160229 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.7121718 |
| Sum | 2248 |
| Variance | 4.903124 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 386 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 1 | 47 | 6.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 10 | 31 | 4.4% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 9 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 1 | 47 | 6.7% |
| 2 | 386 | |
| 3 | 72 | 10.3% |
| 4 | 48 | 6.9% |
| 5 | 39 | 5.6% |
| 6 | 41 | 5.9% |
| 7 | 12 | 1.7% |
| 8 | 21 | 3.0% |
| 9 | 2 | 0.3% |
| 10 | 31 | 4.4% |
| Value | Count | Frequency (%) |
| 10 | 31 | 4.4% |
| 9 | 2 | 0.3% |
| 8 | 21 | 3.0% |
| 7 | 12 | 1.7% |
| 6 | 41 | 5.9% |
| 5 | 39 | 5.6% |
| 4 | 48 | 6.9% |
| 3 | 72 | 10.3% |
| 2 | 386 | |
| 1 | 47 | 6.7% |
Bare Nuclei
Categorical
High correlation 
| Distinct | 11 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| 1 | |
|---|---|
| 10 | |
| 2 | 30 |
| 5 | 30 |
| 3 | 28 |
| Other values (6) |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.1888412 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 10 |
| 3rd row | 2 |
| 4th row | 4 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 402 | |
| 10 | 132 | 18.9% |
| 2 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 8 | 21 | 3.0% |
| 4 | 19 | 2.7% |
| ? | 16 | 2.3% |
| 9 | 9 | 1.3% |
| 7 | 8 | 1.1% |
Length
| Value | Count | Frequency (%) |
| 1 | 402 | |
| 10 | 132 | 18.9% |
| 2 | 30 | 4.3% |
| 5 | 30 | 4.3% |
| 3 | 28 | 4.0% |
| 8 | 21 | 3.0% |
| 4 | 19 | 2.7% |
| 16 | 2.3% | |
| 9 | 9 | 1.3% |
| 7 | 8 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 534 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 815 | |
| Other Punctuation | 16 | 1.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 534 | |
| 0 | 132 | 16.2% |
| 2 | 30 | 3.7% |
| 5 | 30 | 3.7% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.6% |
| 4 | 19 | 2.3% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
| 6 | 4 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 831 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 534 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 831 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 534 | |
| 0 | 132 | 15.9% |
| 2 | 30 | 3.6% |
| 5 | 30 | 3.6% |
| 3 | 28 | 3.4% |
| 8 | 21 | 2.5% |
| 4 | 19 | 2.3% |
| ? | 16 | 1.9% |
| 9 | 9 | 1.1% |
| 7 | 8 | 1.0% |
Bland Chromatin
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4377682 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.4383643 |
|---|---|
| Coefficient of variation (CV) | 0.70928698 |
| Kurtosis | 0.18462131 |
| Mean | 3.4377682 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.0999691 |
| Sum | 2403 |
| Variance | 5.9456202 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 166 | |
| 3 | 165 | |
| 1 | 152 | |
| 7 | 73 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 8 | 28 | 4.0% |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 6 | 10 | 1.4% |
| Value | Count | Frequency (%) |
| 1 | 152 | |
| 2 | 166 | |
| 3 | 165 | |
| 4 | 40 | 5.7% |
| 5 | 34 | 4.9% |
| 6 | 10 | 1.4% |
| 7 | 73 | |
| 8 | 28 | 4.0% |
| 9 | 11 | 1.6% |
| 10 | 20 | 2.9% |
| Value | Count | Frequency (%) |
| 10 | 20 | 2.9% |
| 9 | 11 | 1.6% |
| 8 | 28 | 4.0% |
| 7 | 73 | |
| 6 | 10 | 1.4% |
| 5 | 34 | 4.9% |
| 4 | 40 | 5.7% |
| 3 | 165 | |
| 2 | 166 | |
| 1 | 152 |
Normal Nucleoli
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8669528 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.0536339 |
|---|---|
| Coefficient of variation (CV) | 1.0651148 |
| Kurtosis | 0.47426868 |
| Mean | 2.8669528 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.4222613 |
| Sum | 2004 |
| Variance | 9.32468 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 443 | |
| 10 | 61 | 8.7% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 8 | 24 | 3.4% |
| 6 | 22 | 3.1% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 7 | 16 | 2.3% |
| 9 | 16 | 2.3% |
| Value | Count | Frequency (%) |
| 1 | 443 | |
| 2 | 36 | 5.2% |
| 3 | 44 | 6.3% |
| 4 | 18 | 2.6% |
| 5 | 19 | 2.7% |
| 6 | 22 | 3.1% |
| 7 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 9 | 16 | 2.3% |
| 10 | 61 | 8.7% |
| Value | Count | Frequency (%) |
| 10 | 61 | 8.7% |
| 9 | 16 | 2.3% |
| 8 | 24 | 3.4% |
| 7 | 16 | 2.3% |
| 6 | 22 | 3.1% |
| 5 | 19 | 2.7% |
| 4 | 18 | 2.6% |
| 3 | 44 | 6.3% |
| 2 | 36 | 5.2% |
| 1 | 443 |
Mitoses
Real number (ℝ)
High correlation 
| Distinct | 9 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.5894134 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.7150779 |
|---|---|
| Coefficient of variation (CV) | 1.0790634 |
| Kurtosis | 12.657878 |
| Mean | 1.5894134 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.5606578 |
| Sum | 1111 |
| Variance | 2.9414923 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 579 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 10 | 14 | 2.0% |
| 4 | 12 | 1.7% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| Value | Count | Frequency (%) |
| 1 | 579 | |
| 2 | 35 | 5.0% |
| 3 | 33 | 4.7% |
| 4 | 12 | 1.7% |
| 5 | 6 | 0.9% |
| 6 | 3 | 0.4% |
| 7 | 9 | 1.3% |
| 8 | 8 | 1.1% |
| 10 | 14 | 2.0% |
| Value | Count | Frequency (%) |
| 10 | 14 | 2.0% |
| 8 | 8 | 1.1% |
| 7 | 9 | 1.3% |
| 6 | 3 | 0.4% |
| 5 | 6 | 0.9% |
| 4 | 12 | 1.7% |
| 3 | 33 | 4.7% |
| 2 | 35 | 5.0% |
| 1 | 579 |
Class
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| benign | |
|---|---|
| malignant |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 7.0343348 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | benign |
|---|---|
| 2nd row | benign |
| 3rd row | benign |
| 4th row | benign |
| 5th row | benign |
Common Values
| Value | Count | Frequency (%) |
| benign | 458 | |
| malignant | 241 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| benign | 458 | |
| malignant | 241 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 1398 | |
| g | 699 | |
| i | 699 | |
| a | 482 | 9.8% |
| b | 458 | 9.3% |
| e | 458 | 9.3% |
| m | 241 | 4.9% |
| l | 241 | 4.9% |
| t | 241 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4917 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 1398 | |
| g | 699 | |
| i | 699 | |
| a | 482 | 9.8% |
| b | 458 | 9.3% |
| e | 458 | 9.3% |
| m | 241 | 4.9% |
| l | 241 | 4.9% |
| t | 241 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4917 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 1398 | |
| g | 699 | |
| i | 699 | |
| a | 482 | 9.8% |
| b | 458 | 9.3% |
| e | 458 | 9.3% |
| m | 241 | 4.9% |
| l | 241 | 4.9% |
| t | 241 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4917 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 1398 | |
| g | 699 | |
| i | 699 | |
| a | 482 | 9.8% |
| b | 458 | 9.3% |
| e | 458 | 9.3% |
| m | 241 | 4.9% |
| l | 241 | 4.9% |
| t | 241 | 4.9% |
Interactions
Correlations
| Bare Nuclei | Bland Chromatin | Class | Clump Thickness | Marginal Adhesion | Mitoses | Normal Nucleoli | Sample code number | Single Epithelial Cell Size | Uniformity of Cell Shape | Uniformity of Cell Size | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Bare Nuclei | 1.000 | 0.255 | 0.834 | 0.223 | 0.263 | 0.194 | 0.251 | 0.000 | 0.270 | 0.278 | 0.287 |
| Bland Chromatin | 0.255 | 1.000 | 0.804 | 0.538 | 0.625 | 0.387 | 0.662 | -0.096 | 0.640 | 0.692 | 0.719 |
| Class | 0.834 | 0.804 | 1.000 | 0.738 | 0.738 | 0.519 | 0.768 | 0.000 | 0.791 | 0.860 | 0.875 |
| Clump Thickness | 0.223 | 0.538 | 0.738 | 1.000 | 0.542 | 0.419 | 0.570 | -0.004 | 0.584 | 0.664 | 0.666 |
| Marginal Adhesion | 0.263 | 0.625 | 0.738 | 0.542 | 1.000 | 0.447 | 0.634 | -0.050 | 0.668 | 0.712 | 0.743 |
| Mitoses | 0.194 | 0.387 | 0.519 | 0.419 | 0.447 | 1.000 | 0.504 | -0.075 | 0.480 | 0.473 | 0.509 |
| Normal Nucleoli | 0.251 | 0.662 | 0.768 | 0.570 | 0.634 | 0.504 | 1.000 | -0.071 | 0.706 | 0.725 | 0.757 |
| Sample code number | 0.000 | -0.096 | 0.000 | -0.004 | -0.050 | -0.075 | -0.071 | 1.000 | -0.087 | -0.060 | -0.043 |
| Single Epithelial Cell Size | 0.270 | 0.640 | 0.791 | 0.584 | 0.668 | 0.480 | 0.706 | -0.087 | 1.000 | 0.759 | 0.787 |
| Uniformity of Cell Shape | 0.278 | 0.692 | 0.860 | 0.664 | 0.712 | 0.473 | 0.725 | -0.060 | 0.759 | 1.000 | 0.892 |
| Uniformity of Cell Size | 0.287 | 0.719 | 0.875 | 0.666 | 0.743 | 0.509 | 0.757 | -0.043 | 0.787 | 0.892 | 1.000 |
Missing values
Sample
| Sample code number | Clump Thickness | Uniformity of Cell Size | Uniformity of Cell Shape | Marginal Adhesion | Single Epithelial Cell Size | Bare Nuclei | Bland Chromatin | Normal Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1000025 | 5 | 1 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | benign |
| 1 | 1002945 | 5 | 4 | 4 | 5 | 7 | 10 | 3 | 2 | 1 | benign |
| 2 | 1015425 | 3 | 1 | 1 | 1 | 2 | 2 | 3 | 1 | 1 | benign |
| 3 | 1016277 | 6 | 8 | 8 | 1 | 3 | 4 | 3 | 7 | 1 | benign |
| 4 | 1017023 | 4 | 1 | 1 | 3 | 2 | 1 | 3 | 1 | 1 | benign |
| 5 | 1017122 | 8 | 10 | 10 | 8 | 7 | 10 | 9 | 7 | 1 | malignant |
| 6 | 1018099 | 1 | 1 | 1 | 1 | 2 | 10 | 3 | 1 | 1 | benign |
| 7 | 1018561 | 2 | 1 | 2 | 1 | 2 | 1 | 3 | 1 | 1 | benign |
| 8 | 1033078 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 5 | benign |
| 9 | 1033078 | 4 | 2 | 1 | 1 | 2 | 1 | 2 | 1 | 1 | benign |
| Sample code number | Clump Thickness | Uniformity of Cell Size | Uniformity of Cell Shape | Marginal Adhesion | Single Epithelial Cell Size | Bare Nuclei | Bland Chromatin | Normal Nucleoli | Mitoses | Class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 689 | 654546 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 8 | benign |
| 690 | 654546 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | 1 | benign |
| 691 | 695091 | 5 | 10 | 10 | 5 | 4 | 5 | 4 | 4 | 1 | malignant |
| 692 | 714039 | 3 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 693 | 763235 | 3 | 1 | 1 | 1 | 2 | 1 | 2 | 1 | 2 | benign |
| 694 | 776715 | 3 | 1 | 1 | 1 | 3 | 2 | 1 | 1 | 1 | benign |
| 695 | 841769 | 2 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign |
| 696 | 888820 | 5 | 10 | 10 | 3 | 7 | 3 | 8 | 10 | 2 | malignant |
| 697 | 897471 | 4 | 8 | 6 | 4 | 3 | 4 | 10 | 6 | 1 | malignant |
| 698 | 897471 | 4 | 8 | 8 | 5 | 4 | 5 | 10 | 4 | 1 | malignant |
Duplicate rows
Most frequently occurring
| Sample code number | Clump Thickness | Uniformity of Cell Size | Uniformity of Cell Shape | Marginal Adhesion | Single Epithelial Cell Size | Bare Nuclei | Bland Chromatin | Normal Nucleoli | Mitoses | Class | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 320675 | 3 | 3 | 5 | 2 | 3 | 10 | 7 | 1 | 1 | malignant | 2 |
| 1 | 466906 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | 1 | 1 | benign | 2 |
| 2 | 704097 | 1 | 1 | 1 | 1 | 1 | 1 | 2 | 1 | 1 | benign | 2 |
| 3 | 1100524 | 6 | 10 | 10 | 2 | 8 | 10 | 7 | 3 | 3 | malignant | 2 |
| 4 | 1116116 | 9 | 10 | 10 | 1 | 10 | 8 | 3 | 3 | 1 | malignant | 2 |
| 5 | 1198641 | 3 | 1 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | benign | 2 |
| 6 | 1218860 | 1 | 1 | 1 | 1 | 1 | 1 | 3 | 1 | 1 | benign | 2 |
| 7 | 1321942 | 5 | 1 | 1 | 1 | 2 | 1 | 3 | 1 | 1 | benign | 2 |